A comparison between semi-supervised and supervised text mining techniques on detecting irony in greek political tweets

نویسندگان

  • Basilis Charalampakis
  • Dimitris Spathis
  • Elias Kouslis
  • Katia Kermanidis
چکیده

The present work describes a classification schema for irony detection in Greek political tweets. Our hypothesis states that humorous political tweets could predict actual election results. The irony detection concept is based on subjective perceptions, so only relying on human-annotator driven labor might not be the best route. The proposed approach relies on limited labeled training data, thus a semi-supervised approach is followed, where collective-learning algorithms take both labeled and unlabeled data into consideration. We compare the semi-supervised results with the supervised ones from a previous research of ours. The hypothesis is evaluated via a correlation study between the irony that a party receives on Twitter, its respective actual election results during the Greek parliamentary elections of May 2012, and the difference between these results and the ones of the preceding elections of 2009. & 2016 Elsevier Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecção semi-supervisionada de posicionamento em tweets baseada em regras de sentimento

Stance detection aims to automatically identify if the text author is in favor or against a subject or target. This work describes a semi-supervised method for stance detection. The core is a set of rules to identify stance based on positive or negative opinions of targets directly or indirectly related. Tweets automatically labeled using the rules compose a training corpus for a supervised app...

متن کامل

Microblog Emotion Classification by Computing Similarity in Text, Time, and Space

Most work in NLP analysing microblogs focuses on textual content thus neglecting temporal and spatial information. We present a new interdisciplinary method for emotion classification that combines linguistic, temporal, and spatial information into a single metric. We create a graph of labeled and unlabeled tweets that encodes the relations between neighboring tweets with respect to their emoti...

متن کامل

Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets

In this paper, we propose to study the problem of identifying and classifying tweets into intent categories. For example, a tweet “I wanna buy a new car” indicates the user’s intent for buying a car. Identifying such intent tweets will have great commercial value among others. In particular, it is important that we can distinguish different types of intent tweets. We propose to classify intent ...

متن کامل

Emotion Detection in Persian Text; A Machine Learning Model

This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...

متن کامل

On Classifying the Political Sentiment of Tweets

For this project, we attempted to classify the political sentiment of tweets containing the case-insensitive string ‘Obama’ in an effort to automatically gauge the public opinion of US President Barack Obama. To accomplish this goal we investigated rule-based, supervised, and semi-supervised learning methods. Our main approach involved bootstrapping an ngram-feature-based maximum entropy classi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Eng. Appl. of AI

دوره 51  شماره 

صفحات  -

تاریخ انتشار 2016